9 research outputs found

    Using word and phrase abbreviation patterns to extract age from Twitter microtexts

    Get PDF
    The wealth of texts available publicly online for analysis is ever increasing. Much work in computational linguistics focuses on syntactic, contextual, morphological and phonetic analysis on written documents, vocal recordings, or texts on the internet. Twitter messages present a unique challenge for computational linguistic analysis due to their constrained size. The constraint of 140 characters often prompts users to abbreviate words and phrases. Additionally, as an informal writing medium, messages are not expected to adhere to grammatically or orthographically standard English. As such, Twitter messages are noisy and do not necessarily conform to standard writing conventions of linguistic corpora, often requiring special pre-processing before advanced analysis can be done. In the area of computational linguistics, there is an interest in determining latent attributes of an author. Attributes such as author gender can be determined with some amount of success from many sources, using various methods, such as analysis of shallow linguistic patterns or topic. Author age is more difficult to determine, but previous research has been somewhat successful at classifying age as a binary (e.g. over or under 30), ternary, or even as a continuous variable using various techniques. Twitter messages present a difficult problem for latent user attribute analysis, due to the pre-processing necessary for many computational linguistics analysis tasks. An added logistical challenge is that very few latent attributes are explicitly defined by users on Twitter. Twitter messages are a part of an enormous data set, but the data set must be independently annotated for latent writer attributes not defined through the Twitter API before any classification on such attributes can be done. The actual classification problem is another particular challenge due to restrictions on tweet length. Previous work has shown that word and phrase abbreviation patterns used on Twitter can be indicative of some latent user attributes, such as geographic region or the Twitter client (iPhone, Android, Twitter website, etc.) used to make posts. Language change has generally been posited as being driven by women. This study explores if there there are age-related patterns or change in those patterns over time evident in Twitter posts from a variety of English authors. This work presents a growable data set annotated by Twitter users themselves for age and other useful attributes. The study also presents an extension of prior work on Twitter abbreviation patterns which shows that word and phrase abbreviation patterns can be used toward determining user age. Notable results include classification accuracy of up to 83%, which was 63% above relative majority class baseline (ZeroR in Weka) when classifying user ages into 6 equally sized age bins using a multilayer perceptron network classifier

    GRB 130831a: Rise and demise of a magnetar at z = 0.5

    Get PDF
    Open Access.--14th Marcel Grossman Meeting On Recent Developments in Theoretical and Experimental General Relativity, Astrophysics and Relativistic Field Theories; University of Rome "La Sapienza"Rome; Italy; 12 July 2015 through 18 July 2015; Code 142474.-- http://www.icra.it/mg/mg14/Gamma-ray bursts (GRBs) are the brightest explosions in the universe, yet the properties of their energy sources are far from understood. Very important clues, however, can be deduced by studying the afterglows of these events. We present observations of GRB 130831A and its afterglow obtained with Swift, Chandra, and multiple ground-based observatories. This burst shows an uncommon drop in the X-ray light curve at about 100 ks after the trigger, with a decay slope of α 7. The standard Forward Shock (FS) model offers no explanation for such a behaviour. Instead, a model in which a newly born magnetar outflow powers the early X-ray emission is found to be viable. After the drop, the X-ray afterglow resumes its decay with a slope typical of FS emission. The optical emission, on the other hand, displays no clear break across the X-ray drop and its decay is consistent with that of the late X-rays. Using both the X-ray and optical data, we show that the FS model can explain the emission after 100 ks. We model our data to infer the kinetic energy of the ejecta and thus estimate the efficiency of a magnetar “central engine” of a GRB. Furthermore, we break down the energy budget of this GRB into prompt emission, late internal dissipation, kinetic energy of the relativistic ejecta, and compare it with the energy of the accompanying supernova, SN 2013fu. Copyright © 2018 by the Editors.All rights reserved.Peer reviewe
    corecore